Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 15866 |
| Missing cells | 623 |
| Missing cells (%) | 0.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.7 MiB |
| Average record size in memory | 112.0 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 4 |
brewery_name has a high cardinality: 1753 distinct values | High cardinality |
review_profilename has a high cardinality: 5309 distinct values | High cardinality |
beer_style has a high cardinality: 103 distinct values | High cardinality |
beer_name has a high cardinality: 6526 distinct values | High cardinality |
beer_abv has 621 (3.9%) missing values | Missing |
Unnamed: 0 has unique values | Unique |
Reproduction
| Analysis started | 2020-11-04 01:58:29.517251 |
|---|---|
| Analysis finished | 2020-11-04 01:58:48.489668 |
| Duration | 18.97 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 15866 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 792166.7627 |
|---|---|
| Minimum | 12 |
| Maximum | 1586529 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 124.0 KiB |
Quantile statistics
| Minimum | 12 |
|---|---|
| 5-th percentile | 77447.25 |
| Q1 | 399948 |
| median | 791572.5 |
| Q3 | 1193125.5 |
| 95-th percentile | 1506580.5 |
| Maximum | 1586529 |
| Range | 1586517 |
| Interquartile range (IQR) | 793177.5 |
Descriptive statistics
| Standard deviation | 458183.8024 |
|---|---|
| Coefficient of variation (CV) | 0.5783931161 |
| Kurtosis | -1.203072527 |
| Mean | 792166.7627 |
| Median Absolute Deviation (MAD) | 396346.5 |
| Skewness | -0.001417095701 |
| Sum | 1.256851786e+10 |
| Variance | 2.099323967e+11 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 299006 | 1 | < 0.1% | |
| 1540447 | 1 | < 0.1% | |
| 1383126 | 1 | < 0.1% | |
| 1520339 | 1 | < 0.1% | |
| 926417 | 1 | < 0.1% | |
| 283343 | 1 | < 0.1% | |
| 1131213 | 1 | < 0.1% | |
| 1434313 | 1 | < 0.1% | |
| 748232 | 1 | < 0.1% | |
| 1481414 | 1 | < 0.1% | |
| Other values (15856) | 15856 | 99.9% |
| Value | Count | Frequency (%) | |
| 12 | 1 | < 0.1% | |
| 119 | 1 | < 0.1% | |
| 224 | 1 | < 0.1% | |
| 304 | 1 | < 0.1% | |
| 343 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1586529 | 1 | < 0.1% | |
| 1586364 | 1 | < 0.1% | |
| 1586352 | 1 | < 0.1% | |
| 1586343 | 1 | < 0.1% | |
| 1586248 | 1 | < 0.1% |
brewery_id
Real number (ℝ≥0)
| Distinct | 1769 |
|---|---|
| Distinct (%) | 11.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3183.496596 |
|---|---|
| Minimum | 1 |
| Maximum | 27800 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 124.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 30 |
| Q1 | 142 |
| median | 423 |
| Q3 | 2391 |
| 95-th percentile | 16866 |
| Maximum | 27800 |
| Range | 27799 |
| Interquartile range (IQR) | 2249 |
Descriptive statistics
| Standard deviation | 5689.24471 |
|---|---|
| Coefficient of variation (CV) | 1.787105636 |
| Kurtosis | 3.356527267 |
| Mean | 3183.496596 |
| Median Absolute Deviation (MAD) | 360 |
| Skewness | 2.076065472 |
| Sum | 50509357 |
| Variance | 32367505.37 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 35 | 390 | 2.5% | |
| 10099 | 316 | 2.0% | |
| 147 | 316 | 2.0% | |
| 140 | 278 | 1.8% | |
| 132 | 231 | 1.5% | |
| 287 | 231 | 1.5% | |
| 1199 | 207 | 1.3% | |
| 220 | 190 | 1.2% | |
| 345 | 190 | 1.2% | |
| 29 | 166 | 1.0% | |
| Other values (1759) | 13351 | 84.1% |
| Value | Count | Frequency (%) | |
| 1 | 10 | 0.1% | |
| 3 | 47 | 0.3% | |
| 4 | 79 | 0.5% | |
| 5 | 14 | 0.1% | |
| 6 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 27800 | 1 | < 0.1% | |
| 27681 | 1 | < 0.1% | |
| 27087 | 1 | < 0.1% | |
| 27039 | 8 | 0.1% | |
| 26990 | 1 | < 0.1% |
| Distinct | 1753 |
|---|---|
| Distinct (%) | 11.0% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 124.0 KiB |
| Boston Beer Company (Samuel Adams) | 390 |
|---|---|
| Stone Brewing Co. | 316 |
| Dogfish Head Brewery | 316 |
| Sierra Nevada Brewing Co. | 278 |
| Rogue Ales | 231 |
| Other values (1748) |
| Value | Count | Frequency (%) | |
| Boston Beer Company (Samuel Adams) | 390 | 2.5% | |
| Stone Brewing Co. | 316 | 2.0% | |
| Dogfish Head Brewery | 316 | 2.0% | |
| Sierra Nevada Brewing Co. | 278 | 1.8% | |
| Rogue Ales | 231 | 1.5% | |
| Bell's Brewery, Inc. | 231 | 1.5% | |
| Founders Brewing Company | 207 | 1.3% | |
| Lagunitas Brewing Company | 190 | 1.2% | |
| Victory Brewing Company | 190 | 1.2% | |
| Avery Brewing Company | 166 | 1.0% | |
| Other values (1743) | 13350 | 84.1% |
Unique
| Unique | 705 ? |
|---|---|
| Unique (%) | 4.4% |
Length
| Max length | 66 |
|---|---|
| Median length | 23 |
| Mean length | 23.69128955 |
| Min length | 3 |
review_time
Real number (ℝ≥0)
| Distinct | 15865 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1225493757 |
|---|---|
| Minimum | 894931201 |
| Maximum | 1326251972 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 124.0 KiB |
Quantile statistics
| Minimum | 894931201 |
|---|---|
| 5-th percentile | 1073805417 |
| Q1 | 1174793682 |
| median | 1241334608 |
| Q3 | 1289233383 |
| 95-th percentile | 1318747448 |
| Maximum | 1326251972 |
| Range | 431320771 |
| Interquartile range (IQR) | 114439701.2 |
Descriptive statistics
| Standard deviation | 76132666.97 |
|---|---|
| Coefficient of variation (CV) | 0.06212407576 |
| Kurtosis | -0.2562526226 |
| Mean | 1225493757 |
| Median Absolute Deviation (MAD) | 53092013 |
| Skewness | -0.7620441542 |
| Sum | 1.944368394e+13 |
| Variance | 5.79618298e+15 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1290897310 | 2 | < 0.1% | |
| 1177038841 | 1 | < 0.1% | |
| 1217819467 | 1 | < 0.1% | |
| 1303083850 | 1 | < 0.1% | |
| 1319068489 | 1 | < 0.1% | |
| 1137384262 | 1 | < 0.1% | |
| 1232747330 | 1 | < 0.1% | |
| 1300144961 | 1 | < 0.1% | |
| 1258826560 | 1 | < 0.1% | |
| 1228915517 | 1 | < 0.1% | |
| Other values (15855) | 15855 | 99.9% |
| Value | Count | Frequency (%) | |
| 894931201 | 1 | < 0.1% | |
| 908236801 | 1 | < 0.1% | |
| 917308801 | 1 | < 0.1% | |
| 984787201 | 1 | < 0.1% | |
| 993912246 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1326251972 | 1 | < 0.1% | |
| 1326232870 | 1 | < 0.1% | |
| 1326227237 | 1 | < 0.1% | |
| 1326216953 | 1 | < 0.1% | |
| 1326185385 | 1 | < 0.1% |
review_overall
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.823490483 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 124.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4.5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7112147028 |
|---|---|
| Coefficient of variation (CV) | 0.186011893 |
| Kurtosis | 1.627512023 |
| Mean | 3.823490483 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -1.011864773 |
| Sum | 60663.5 |
| Variance | 0.5058263534 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4 | 5910 | 37.2% | |
| 4.5 | 3231 | 20.4% | |
| 3.5 | 3023 | 19.1% | |
| 3 | 1612 | 10.2% | |
| 5 | 918 | 5.8% | |
| 2.5 | 581 | 3.7% | |
| 2 | 373 | 2.4% | |
| 1.5 | 122 | 0.8% | |
| 1 | 96 | 0.6% |
| Value | Count | Frequency (%) | |
| 1 | 96 | 0.6% | |
| 1.5 | 122 | 0.8% | |
| 2 | 373 | 2.4% | |
| 2.5 | 581 | 3.7% | |
| 3 | 1612 | 10.2% |
| Value | Count | Frequency (%) | |
| 5 | 918 | 5.8% | |
| 4.5 | 3231 | 20.4% | |
| 4 | 5910 | 37.2% | |
| 3.5 | 3023 | 19.1% | |
| 3 | 1612 | 10.2% |
review_aroma
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.74454809 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 124.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 4.5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.6929546586 |
|---|---|
| Coefficient of variation (CV) | 0.1850569526 |
| Kurtosis | 1.257369698 |
| Mean | 3.74454809 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.8668971118 |
| Sum | 59411 |
| Variance | 0.4801861588 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4 | 5651 | 35.6% | |
| 3.5 | 3671 | 23.1% | |
| 4.5 | 2737 | 17.3% | |
| 3 | 1922 | 12.1% | |
| 2.5 | 646 | 4.1% | |
| 5 | 639 | 4.0% | |
| 2 | 400 | 2.5% | |
| 1.5 | 132 | 0.8% | |
| 1 | 68 | 0.4% |
| Value | Count | Frequency (%) | |
| 1 | 68 | 0.4% | |
| 1.5 | 132 | 0.8% | |
| 2 | 400 | 2.5% | |
| 2.5 | 646 | 4.1% | |
| 3 | 1922 | 12.1% |
| Value | Count | Frequency (%) | |
| 5 | 639 | 4.0% | |
| 4.5 | 2737 | 17.3% | |
| 4 | 5651 | 35.6% | |
| 3.5 | 3671 | 23.1% | |
| 3 | 1922 | 12.1% |
review_appearance
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.845833859 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 124.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 4.5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.6081135223 |
|---|---|
| Coefficient of variation (CV) | 0.1581226711 |
| Kurtosis | 1.808656181 |
| Mean | 3.845833859 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.9120150828 |
| Sum | 61018 |
| Variance | 0.369802056 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4 | 6827 | 43.0% | |
| 3.5 | 3172 | 20.0% | |
| 4.5 | 2890 | 18.2% | |
| 3 | 1658 | 10.5% | |
| 5 | 625 | 3.9% | |
| 2.5 | 350 | 2.2% | |
| 2 | 259 | 1.6% | |
| 1.5 | 52 | 0.3% | |
| 1 | 33 | 0.2% |
| Value | Count | Frequency (%) | |
| 1 | 33 | 0.2% | |
| 1.5 | 52 | 0.3% | |
| 2 | 259 | 1.6% | |
| 2.5 | 350 | 2.2% | |
| 3 | 1658 | 10.5% |
| Value | Count | Frequency (%) | |
| 5 | 625 | 3.9% | |
| 4.5 | 2890 | 18.2% | |
| 4 | 6827 | 43.0% | |
| 3.5 | 3172 | 20.0% | |
| 3 | 1658 | 10.5% |
| Distinct | 5309 |
|---|---|
| Distinct (%) | 33.5% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 124.0 KiB |
| northyorksammy | 61 |
|---|---|
| Thorpe429 | 52 |
| BuckeyeNation | 52 |
| mikesgroove | 45 |
| ChainGangGuy | 43 |
| Other values (5304) |
| Value | Count | Frequency (%) | |
| northyorksammy | 61 | 0.4% | |
| Thorpe429 | 52 | 0.3% | |
| BuckeyeNation | 52 | 0.3% | |
| mikesgroove | 45 | 0.3% | |
| ChainGangGuy | 43 | 0.3% | |
| Billolick | 38 | 0.2% | |
| TheManiacalOne | 38 | 0.2% | |
| akorsak | 37 | 0.2% | |
| drabmuh | 34 | 0.2% | |
| Mora2000 | 34 | 0.2% | |
| Other values (5299) | 15431 | 97.3% |
Unique
| Unique | 2755 ? |
|---|---|
| Unique (%) | 17.4% |
Length
| Max length | 16 |
|---|---|
| Median length | 9 |
| Mean length | 8.975797302 |
| Min length | 3 |
| Distinct | 103 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 124.0 KiB |
| American IPA | 1159 |
|---|---|
| American Double / Imperial IPA | 883 |
| American Pale Ale (APA) | 619 |
| American Porter | 550 |
| American Double / Imperial Stout | 524 |
| Other values (98) |
| Value | Count | Frequency (%) | |
| American IPA | 1159 | 7.3% | |
| American Double / Imperial IPA | 883 | 5.6% | |
| American Pale Ale (APA) | 619 | 3.9% | |
| American Porter | 550 | 3.5% | |
| American Double / Imperial Stout | 524 | 3.3% | |
| Russian Imperial Stout | 520 | 3.3% | |
| American Amber / Red Ale | 446 | 2.8% | |
| Belgian Strong Dark Ale | 356 | 2.2% | |
| Fruit / Vegetable Beer | 350 | 2.2% | |
| Saison / Farmhouse Ale | 335 | 2.1% | |
| Other values (93) | 10124 | 63.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 35 |
|---|---|
| Median length | 18 |
| Mean length | 17.81488718 |
| Min length | 4 |
review_palate
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.746785579 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 124.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 4.5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.6732922656 |
|---|---|
| Coefficient of variation (CV) | 0.179698638 |
| Kurtosis | 1.311415167 |
| Mean | 3.746785579 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.8658062548 |
| Sum | 59446.5 |
| Variance | 0.4533224749 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4 | 6155 | 38.8% | |
| 3.5 | 3412 | 21.5% | |
| 4.5 | 2502 | 15.8% | |
| 3 | 2033 | 12.8% | |
| 2.5 | 631 | 4.0% | |
| 5 | 599 | 3.8% | |
| 2 | 366 | 2.3% | |
| 1.5 | 108 | 0.7% | |
| 1 | 60 | 0.4% |
| Value | Count | Frequency (%) | |
| 1 | 60 | 0.4% | |
| 1.5 | 108 | 0.7% | |
| 2 | 366 | 2.3% | |
| 2.5 | 631 | 4.0% | |
| 3 | 2033 | 12.8% |
| Value | Count | Frequency (%) | |
| 5 | 599 | 3.8% | |
| 4.5 | 2502 | 15.8% | |
| 4 | 6155 | 38.8% | |
| 3.5 | 3412 | 21.5% | |
| 3 | 2033 | 12.8% |
review_taste
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.802880373 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 124.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4.5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7247920843 |
|---|---|
| Coefficient of variation (CV) | 0.1905902929 |
| Kurtosis | 1.358259339 |
| Mean | 3.802880373 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.9601350005 |
| Sum | 60336.5 |
| Variance | 0.5253235655 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4 | 5368 | 33.8% | |
| 4.5 | 3413 | 21.5% | |
| 3.5 | 3292 | 20.7% | |
| 3 | 1657 | 10.4% | |
| 5 | 861 | 5.4% | |
| 2.5 | 638 | 4.0% | |
| 2 | 412 | 2.6% | |
| 1.5 | 128 | 0.8% | |
| 1 | 97 | 0.6% |
| Value | Count | Frequency (%) | |
| 1 | 97 | 0.6% | |
| 1.5 | 128 | 0.8% | |
| 2 | 412 | 2.6% | |
| 2.5 | 638 | 4.0% | |
| 3 | 1657 | 10.4% |
| Value | Count | Frequency (%) | |
| 5 | 861 | 5.4% | |
| 4.5 | 3413 | 21.5% | |
| 4 | 5368 | 33.8% | |
| 3.5 | 3292 | 20.7% | |
| 3 | 1657 | 10.4% |
| Distinct | 6526 |
|---|---|
| Distinct (%) | 41.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 124.0 KiB |
| Old Rasputin Russian Imperial Stout | 42 |
|---|---|
| 90 Minute IPA | 35 |
| Sierra Nevada Celebration Ale | 34 |
| India Pale Ale | 33 |
| Pale Ale | 32 |
| Other values (6521) |
| Value | Count | Frequency (%) | |
| Old Rasputin Russian Imperial Stout | 42 | 0.3% | |
| 90 Minute IPA | 35 | 0.2% | |
| Sierra Nevada Celebration Ale | 34 | 0.2% | |
| India Pale Ale | 33 | 0.2% | |
| Pale Ale | 32 | 0.2% | |
| Stone Ruination IPA | 32 | 0.2% | |
| Sierra Nevada Pale Ale | 31 | 0.2% | |
| Two Hearted Ale | 30 | 0.2% | |
| Ayinger Celebrator Doppelbock | 30 | 0.2% | |
| Founders KBS (Kentucky Breakfast Stout) | 29 | 0.2% | |
| Other values (6516) | 15538 | 97.9% |
Unique
| Unique | 3951 ? |
|---|---|
| Unique (%) | 24.9% |
Length
| Max length | 74 |
|---|---|
| Median length | 19 |
| Mean length | 20.56271272 |
| Min length | 2 |
| Distinct | 254 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 621 |
| Missing (%) | 3.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.045513283 |
|---|---|
| Minimum | 0.05 |
| Maximum | 41 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 124.0 KiB |
Quantile statistics
| Minimum | 0.05 |
|---|---|
| 5-th percentile | 4.5 |
| Q1 | 5.2 |
| median | 6.5 |
| Q3 | 8.5 |
| 95-th percentile | 11 |
| Maximum | 41 |
| Range | 40.95 |
| Interquartile range (IQR) | 3.3 |
Descriptive statistics
| Standard deviation | 2.320866046 |
|---|---|
| Coefficient of variation (CV) | 0.3294104989 |
| Kurtosis | 10.5706658 |
| Mean | 7.045513283 |
| Median Absolute Deviation (MAD) | 1.5 |
| Skewness | 1.762830752 |
| Sum | 107408.85 |
| Variance | 5.386419202 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 5 | 1078 | 6.8% | |
| 8 | 705 | 4.4% | |
| 6 | 642 | 4.0% | |
| 7 | 616 | 3.9% | |
| 9 | 601 | 3.8% | |
| 5.5 | 561 | 3.5% | |
| 10 | 532 | 3.4% | |
| 6.5 | 480 | 3.0% | |
| 5.2 | 434 | 2.7% | |
| 7.5 | 433 | 2.7% | |
| Other values (244) | 9163 | 57.8% | |
| (Missing) | 621 | 3.9% |
| Value | Count | Frequency (%) | |
| 0.05 | 1 | < 0.1% | |
| 0.45 | 1 | < 0.1% | |
| 0.5 | 4 | < 0.1% | |
| 1.2 | 1 | < 0.1% | |
| 1.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 41 | 2 | < 0.1% | |
| 32 | 2 | < 0.1% | |
| 27 | 2 | < 0.1% | |
| 26 | 2 | < 0.1% | |
| 19.5 | 1 | < 0.1% |
beer_beerid
Real number (ℝ≥0)
| Distinct | 6752 |
|---|---|
| Distinct (%) | 42.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22027.5 |
|---|---|
| Minimum | 5 |
| Maximum | 76814 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 124.0 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 219 |
| Q1 | 1769 |
| median | 14925.5 |
| Q3 | 39621 |
| 95-th percentile | 62999 |
| Maximum | 76814 |
| Range | 76809 |
| Interquartile range (IQR) | 37852 |
Descriptive statistics
| Standard deviation | 21902.53081 |
|---|---|
| Coefficient of variation (CV) | 0.9943266739 |
| Kurtosis | -0.8454913006 |
| Mean | 22027.5 |
| Median Absolute Deviation (MAD) | 14149.5 |
| Skewness | 0.6717315153 |
| Sum | 349488315 |
| Variance | 479720855.8 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 412 | 42 | 0.3% | |
| 2093 | 35 | 0.2% | |
| 1904 | 34 | 0.2% | |
| 4083 | 32 | 0.2% | |
| 276 | 31 | 0.2% | |
| 1093 | 30 | 0.2% | |
| 131 | 30 | 0.2% | |
| 680 | 29 | 0.2% | |
| 19960 | 29 | 0.2% | |
| 92 | 27 | 0.2% | |
| Other values (6742) | 15547 | 98.0% |
| Value | Count | Frequency (%) | |
| 5 | 2 | < 0.1% | |
| 6 | 2 | < 0.1% | |
| 7 | 5 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 76814 | 1 | < 0.1% | |
| 76813 | 1 | < 0.1% | |
| 76803 | 1 | < 0.1% | |
| 76756 | 1 | < 0.1% | |
| 76726 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Unnamed: 0 | brewery_id | brewery_name | review_time | review_overall | review_aroma | review_appearance | review_profilename | beer_style | review_palate | review_taste | beer_name | beer_abv | beer_beerid | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1232172 | 10001 | Hockley Valley Brewing Co. | 1268863522 | 3.5 | 3.5 | 3.0 | MattyV | Irish Dry Stout | 2.0 | 3.0 | Hockley Stout | 4.6 | 35859 |
| 1 | 521244 | 113 | Samuel Smith Old Brewery (Tadcaster) | 1209435786 | 4.5 | 4.0 | 4.5 | bluegrassbrew | English Porter | 4.0 | 4.0 | Samuel Smith's, The Famous Taddy Porter | 5.0 | 572 |
| 2 | 1098847 | 418 | Left Hand Brewing Company | 1309720985 | 3.5 | 3.5 | 3.5 | DrJay | English India Pale Ale (IPA) | 3.5 | 3.5 | 400 Pound Monkey | 6.7 | 44706 |
| 3 | 246137 | 607 | High Point Brewing Company | 1063648679 | 4.5 | 4.5 | 4.0 | Dantes | Märzen / Oktoberfest | 4.5 | 4.0 | Ramstein Oktoberfest | 6.0 | 12718 |
| 4 | 1260943 | 112 | North Coast Brewing Co. | 1269408994 | 4.0 | 4.0 | 3.5 | nickfl | German Pilsener | 3.5 | 3.5 | Scrimshaw Pilsner | 4.4 | 409 |
| 5 | 1519120 | 735 | 21st Amendment Brewery | 1310115300 | 3.5 | 3.5 | 3.5 | rootbeerman | American Double / Imperial IPA | 3.0 | 3.5 | Hop Crisis | 9.7 | 42063 |
| 6 | 1350294 | 7944 | Ridgeway Brewing | 1261368179 | 4.0 | 3.5 | 3.5 | Richardberg | Foreign / Export Stout | 4.0 | 4.0 | Lump Of Coal | 8.0 | 20905 |
| 7 | 218892 | 158 | Great Divide Brewing Company | 1322882947 | 4.0 | 4.0 | 4.0 | SolipsismalCat | Old Ale | 4.0 | 4.0 | Hibernation Ale | 8.7 | 1446 |
| 8 | 1057389 | 200 | Mendocino Brewing Company | 1227070530 | 3.5 | 4.0 | 4.0 | PatrickJR | American Barleywine | 3.5 | 3.0 | Talon - True Style Barley Wine Ale | 10.5 | 16439 |
| 9 | 1472182 | 1086 | Cleveland ChopHouse And Brewery | 1170090653 | 4.5 | 4.0 | 4.0 | mattcrill | Belgian Strong Pale Ale | 4.0 | 4.0 | Hop Diablo | NaN | 35010 |
Last rows
| Unnamed: 0 | brewery_id | brewery_name | review_time | review_overall | review_aroma | review_appearance | review_profilename | beer_style | review_palate | review_taste | beer_name | beer_abv | beer_beerid | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 15856 | 1013003 | 143 | Spoetzl Brewery | 1235428401 | 3.5 | 3.5 | 3.5 | Vengeance526 | Vienna Lager | 3.5 | 3.5 | Shiner 98 Bavarian Style Amber | NaN | 36866 |
| 15857 | 674390 | 14400 | Ninkasi Brewing Company | 1250638883 | 4.0 | 3.0 | 4.0 | morimech | American Pale Ale (APA) | 4.0 | 3.5 | Radiant Summer Ale | 6.0 | 50271 |
| 15858 | 1239111 | 494 | Malt Shovel Brewery | 1092280686 | 3.5 | 4.0 | 4.0 | Zorro | English Porter | 3.5 | 3.5 | James Squire Porter | 5.0 | 2078 |
| 15859 | 282905 | 35 | Boston Beer Company (Samuel Adams) | 1250169053 | 4.0 | 4.0 | 4.0 | ghebb | Vienna Lager | 4.0 | 4.0 | Samuel Adams Boston Lager | 4.9 | 104 |
| 15860 | 263580 | 68 | Flying Dog Brewery | 1284672178 | 3.0 | 3.5 | 4.5 | EricCioe | American Double / Imperial IPA | 4.0 | 4.5 | Double Dog Double Pale Ale | 11.5 | 35754 |
| 15861 | 69060 | 466 | South African Breweries plc | 1309552188 | 4.0 | 3.5 | 4.5 | katan | Milk / Sweet Stout | 3.5 | 3.5 | Castle Milk Stout | 6.0 | 7371 |
| 15862 | 618382 | 10097 | Harpoon Brewery | 1309836484 | 2.0 | 2.0 | 4.0 | SWMeyer4141 | American Pale Wheat Ale | 1.5 | 2.0 | UFO Hefeweizen | 5.1 | 318 |
| 15863 | 904258 | 142 | Spaten-Franziskaner-Bräu | 1297177461 | 4.5 | 4.0 | 4.0 | CuriousMonk | Hefeweizen | 4.0 | 4.5 | Franziskaner Hefe-Weisse | 5.0 | 1946 |
| 15864 | 1035686 | 39 | Privatbrauerei Franz Inselkammer KG / Brauerei Aying | 1286336151 | 4.0 | 3.0 | 3.5 | TheQuietMan22 | Dortmunder / Export Lager | 3.5 | 4.0 | Ayinger Jahrhundert Bier | 5.5 | 133 |
| 15865 | 1135428 | 863 | Russian River Brewing Company | 1297447950 | 5.0 | 4.5 | 4.5 | SupaCelt | American Double / Imperial IPA | 4.5 | 4.5 | Pliny The Elder | 8.0 | 7971 |